Deep Learning Compiler

Why: to be able to holistically evaluate and optimize the entire network, including across-layer optimizations, which cannot be done by interpreters
What: compiler that translate a computational graph to target-specific kernels that can be executed in the target platform
How:
1. Graph Level Optimizations
2. Layout Optimization
3. Kernel Selection and Kernel Generation
4. Scheduling
5. Tensor Allocation
How good: out-of-the-box high performance

	Traditional Compiler	DLC
Input	High-level languages	computational graph
Output	Low-level languages like asm	kernels